Typical Sequences for Polish Alphabets
نویسنده
چکیده
The notion of typical sequences plays a key role in the theory of information. Central to the idea of typicality is that a sequence x1, x2, . . . , xn that is PX -typical should, loosely speaking, have an empirical distribution that is in some sense close to the distribution PX . The two most common notions of typicality are that of strong (letter) typicality and weak (entropy) typicality. While weak typicality allows one to apply many arguments that can be made with strongly typical arguments, some arguments for strong typicality cannot be generalized to weak typicality. In this paper, we consider an alternate definition of typicality, namely one based on the weak* topology and that is applicable to Polish alphabets (which includes R). This notion is a generalization of strong typicality in the sense that it degenerates to strong typicality in the finite alphabet case, and can also be applied to mixed and continuous distributions. Furthermore, it is strong enough to prove a Markov lemma, and thus can be used to directly prove a more general class of results than weak typicality. As an example of this technique, we directly prove achievability for Gel’fand-Pinsker channels with input constraints for a large class of alphabets and channels without first proving a finite alphabet result and then resorting to delicate quantization arguments. While this large class does not include Gaussian distributions with power constraints, it is shown to be straightforward to recover this case by considering a sequence of truncated Gaussian distributions. Index Terms Typical sequences, weak* topology, capacity, Gel’fand-Pinsker.
منابع مشابه
Practical Rank/Select Queries over Arbitrary Sequences
We present a practical study on the compact representation of sequences supporting rank, select, and access queries. While there are several theoretical solutions to the problem, only a few have been tried out, and there is little idea on how the others would perform, especially in the case of sequences with very large alphabets. We first present a new practical implementation of the compressed...
متن کاملReduced amino acid alphabets exhibit an improved sensitivity and selectivity in fold assignment
MOTIVATION Many proteins with vastly dissimilar sequences are found to share a common fold, as evidenced in the wealth of structures now available in the Protein Data Bank. One idea that has found success in various applications is the concept of a reduced amino acid alphabet, wherein similar amino acids are clustered together. Given the structural similarity exhibited by many apparently dissim...
متن کاملTypicality Graphs:Large Deviation Analysis
Let X and Y be finite alphabets and PXY a joint distribution over them, with PX and PY representing the marginals. For any ε > 0, the set of n-length sequences x and y that are jointly typical [1] according to PXY can be represented on a bipartite graph. We present a formal definition of such a graph, known as a typicality graph, and study some of its properties.
متن کاملStatistics of RNA secondary structures.
A statistical reference for RNA secondary structures with minimum free energies is computed by folding large ensembles of random RNA sequences. Four nucleotide alphabets are used: two binary alphabets, AU and GC, the biophysical AUGC and the synthetic GCXK alphabet. RNA secondary structures are made of structural elements, such as stacks, loops, joints, and free ends. Statistical properties of ...
متن کاملA formalism for studying long - range correlations in many - alphabets sequences
S. L. Narasimhan, Joseph A. Nathan, P. S. R. Krishna and K. P. N. Murthy Solid State Physics Division, Reactor Physics Design Division Bhabha Atomic Research Centre, Mumbai-400085, India. Materials Science Division, Indira Gandhi Centre for Atomic Research, Kalpakkam 603102, Tamilnadu, India. Abstract We formulate a mean-field-like theory of long-range correlated L-alphabets sequences, which ar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1005.2321 شماره
صفحات -
تاریخ انتشار 2010